Sequence similarity in structurally dissimilar proteins
نویسندگان
چکیده
It is a central tenet of protein evolution that the threedimensional structure of a protein family is better conserved than the sequences themselves. So structural similarities between related proteins should be detectable over longer evolutionary distances and be most useful for the functional prediction of proteins without closely related homologs. Comparative sequence and structural analysis of prokaryotic phospholipase A2 (PLA2) from Streptomyces violaceoruber reveals a clear violation of this dogma. While the enzymatic properties of this enzyme match closely those of eukaryotic secretory PLA2s, the corresponding structures appear very different, and no appreciable structural similarity is detected by a number of comparison methods. Nevertheless, contemporary sequence analysis methods demonstrate a highly significant relationship between the sequences of both families and provide a structurally correct alignment of the active site residues. The recent literature documents many cases of unexpected structural conservation in the complete absence of detectable sequence relatedness. Although in many of those cases the purported lack of sequence similarity does not stand closer scrutiny with sophisticated sequence comparison methods, there clearly are some impressive examples of structure-based functional predictions where sequence comparisons fail [1]. Here, we describe an unusual case where this situation appears to be reversed: related proteins with an unambiguously detectable sequence similarity but whose structures have diverged beyond recognition. Secretory (s)PLA2s are a class of small enzymes that hydrolyze the 2-acyl ester bond of 1,2diacylglycero-3-phospholipids [2]. Most eukaryotic cells produce multiple sPLA2 isoenzymes, and several poisonous animals, including bees and snakes, use members of this enzyme class as a major component of their toxins. All secreted PLA2 forms contain Ca(II) ions, which are directly involved in the catalytic reaction [2]. Recently, the first prokaryotic PLA2 has been isolated and cloned from the bacterium S. violaceoruber, and subsequently was structurally characterized [3,4]. The enzymatic properties of bacterial PLA2 resemble closely that of the eukaryotic sPLA2 forms, including the strict requirement for Ca(II). But the lack of visible sequence similarity and the fundamentally different structural fold have been interpreted as indicating a distinct evolutionary origin [3]. As indicated in the original report on cloning Streptomyces PLA2, sequence database searches with standard methods like BLAST [5] reveal significant similarities to a number of uncharacterized proteins from other bacteria, but fail to show a relationship to established PLA2 forms or to other hydrolytic enzymes. To search for more distant sequence relatives, we constructed generalized profiles [6] from a multiple alignment of the bacterial PLA2 and its reliable BLAST matches (Figure 1, upper part). Unexpectedly, the result of the profile search clearly demonstrates a significant relationship to a number of established PLA2s: The best match (p<0.01) was the conodipine-M α chain from cone snails, a well-characterized PLA2 toxin [7]. Among the next seven high-scoring sequences, six corresponded to known eukaryotic PLA2s, including the group XIV enzyme from Drosophila and the mammalian group XIII enzymes. The seventh sequence was an uncharacterized Pseudomonas protein, another likely PLA2. The best-scoring non-PLA2 was the MAP kinase ERK4, which reached an insignificant p value of only 0.6. As expected for profile searches, the significance values of other eukaryotic PLA2 sequences further improved after incorporating conodipine-M in a subsequent cycle of iterative profile refinement. As shown in Figure 1, two classes of residues are nearly invariant between prokaryotic and eukaryotic PLA2 isoenzymes: two cysteine residues that form a structurally important disulfide bridge, and the polar residues required for catalysis and the coordination of one Ca(II) ion. The eukaryotic PLA2 enzymes typically Figure 1. Alignment of prokaryotic and eukaryotic sPLA2 families. Multiple alignment of the prokaryotic PLA2 family and representative members of the eukaryotic sPLA2 families for comparison. Sequence names are in the leftmost column: PLA2 from S. violaceoruber and S. coelicolor; SCO01048 from S. coelicolor; Pst1 from Tuber borchii; RHSA from S. clavuligerus; p15 from Helicosporium sp; CGL2546 from Corynebacterium glutamicum; ORF from Neurospora crassa; and PLA2s from cone snail, cobra, bee, and various human isoenzymes. Only the most conserved part of the catalytic domain is shown. The position of the fragment relative to the complete sequence is indicated by numbers. Residues invariant or conservatively replaced in at least 50% of the sequences are printed on black or grey background, respectively. The catalytically important residues conserved between all PLA2 subtypes are printed on red background. Strep viol 64 AYEFDWSTDLCTQAPDNP.........FGFPFNTACARHDFGYRNYKA.........AGSFDANKSRIDSAFYEDMKRVCTGYT Strep 1048 63 AYGFDWTTDYCSSSPDNP.........FGFPFNTSCARHDFGYRNYKD.........AGTFSANKSRLDSAFYEDLKRVCAGYG Tuber borc 118 PGNLDWSDDGCSKSPDRP.........AGFNFLDSCKRHDFGYRNYKK........QHRFTEANRKRIDDNFKKDLYNECAKYS Strep RHSA 600 RGDLVWSDDGCSAPWYSHIVIGPSVGYYSGQFYWPCARHDFGYRNYRK........QNRRTRANKDKIDNRFRYDMKKRICEPK Helicospor 59 PSTLDWSSDGCSSSPDDP.........FGFDFLSSCHRHDFGYRNYKK........QNRFTAPNKARIDTNFKTDMYNQCNTES Corynebact 447 DPDAYGRHDYCTLSPDSY.....GPLGKKAEFSGACARHDLCMDAVDA............NGTGYAPCHPAFYTWMSTVCTTNY Neurospora 65 PATLDWSSDSCSYSPDNP.........LGFPFSPACNRHDFGYRNYKA........QSRFTDNNKLKIDGNFKTDLYYQCDTHG
منابع مشابه
Sequence-similar, structure-dissimilar protein pairs in the PDB
It is often assumed that in the Protein Data Bank (PDB), two proteins with similar sequences will also have similar structures. Accordingly, it has proved useful to develop subsets of the PDB from which "redundant" structures have been removed, based on a sequence-based criterion for similarity. Similarly, when predicting protein structure using homology modeling, if a template structure for mo...
متن کاملThe 1.7 A crystal structure of BPI: a study of how two dissimilar amino acid sequences can adopt the same fold.
We have extended the resolution of the crystal structure of human bactericidal/permeability-increasing protein (BPI) to 1.7 A. BPI has two domains with the same fold, but with little sequence similarity. To understand the similarity in structure of the two domains, we compare the corresponding residue positions in the two domains by the method of 3D-1D profiles. A 3D-1D profile is a string form...
متن کاملProteins of the same fold and unrelated sequences have similar amino acid composition.
It is well established that there is a relationship between the amino acid composition of a protein and its structural class (i.e., alpha, beta, alpha + beta, or alpha/beta). Several studies have even shown the power of amino acid composition in predicting the secondary structure class of a protein. Herein, we show that significant similarity in amino acid composition exists not only between pr...
متن کاملGlimmers in the Midnight Zone: Characterization of Aligned Identical Residues in Sequence-Dissimilar Proteins Sharing a Common Fold
Sequence comparison of proteins that adopt the same fold has revealed a large degree of sequence variation. There are many pairs of structurally similar proteins with only a very low percentage of identical residues at structurally aligned positions. It is not clear whether these few identical residues have been conserved just by coincidence, or due to their structural and/or functional role Th...
متن کاملA Boost for Sequence Searching Extension to BLAST Can Improve its Sensitivity Twofold
descended from a common ancestor, usually not only have similar sequences but also similar structures and functions. Hence, when two sequences are similar to a degree that cannot be explained by chance, we can assume that the similarity in sequence arose by common descent and that the proteins are therefore likely to be structurally and functionally similar. This principle of “homology-based in...
متن کاملStructural Similarity and Classification of Protein Interaction Interfaces
Interactions between proteins play a key role in many cellular processes. Studying protein-protein interactions that share similar interaction interfaces may shed light on their evolution and could be helpful in elucidating the mechanisms behind stability and dynamics of the protein complexes. When two complexes share structurally similar subunits, the similarity of the interaction interfaces c...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Current Biology
دوره 13 شماره
صفحات -
تاریخ انتشار 2003